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Abstract 

Recent research has documented a significant rise in the volatility (e.g., 
expected squared change) of individual incomes in the U.S. since the 1970s. 
Existing measures of this trend abstract from individual heterogeneity, effec- 
tively estimating an increase in average volatility. We decompose this increase 
in average volatility and find that it is far from representative of the experi- 
ence of most people: there has been no systematic rise in volatility for the 
vast majority of individuals. The rise in average volatility has been driven 
almost entirely by a sharp rise in the income volatility of those expected to 
have the most volatile incomes, identified ex-ante by large income changes in 
the past. We document that the self-employed and those who self-identify as 
risk-tolerant are much more likely to have such volatile incomes; these groups 
have experienced much larger increases in income volatility than the popula- 
tion at large. These results color the policy implications one might draw from 
the rise in average volatility. While the basic results are apparent from PSID 
summary statistics, providing a complete characterization of the dynamics of 
the volatility distribution is a methodological challenge. We resolve these dif- 
ficulties with a Markovian hierarchical Dirichlet process that builds on work 
from the non-parametric Bayesian statistics literature. 
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1 Introduction 



A large literature argues that income volatility - the expectation of squared individual 
income changes - has increased substantially since the 1970s in the U.S., with further 
increases since the 1990sj^] To the degree that people are risk-averse and income 
volatility is taken as a proxy for risk, ceteris paribus such rising volatility may carry 
substantial welfare costs. As a consequence, there has been a great deal of recent 



interest by politicians and journalists in this finding. (Gosselin, December 12, 2004 



Scheiber, Decemer 12, 2004 Hou January 31, 2007) 



To date, research on income volatility trends has ignored individual heterogeneity, 
effectively estimating an increase in average volatility. We decompose this increase 
in the average and find that it is far from representative of the experience of most 
people: there has been no systematic increase in volatility for the vast majority of 
individuals. The increase has been driven almost entirely by a sharp increase in the 
income volatility of those with the most volatile incomes. In turn, we find that these 
individuals with high - and increasing - volatility more likely to be self-employed and 
more likely to self-identify as risk-tolerant. 

Our main finding is apparent in simple summary statistics from the PSID. For 
example, divide the sample into cohorts, comparing the minority who experienced 
very large absolute one-year income changes in the past (e.g., four years ago) to those 
who did not. Since volatility is persistent, those identified ex- ante by large past 
income changes naturally tend to have more volatile incomes today. The income 
volatility of this group identified ex-ante as high-volatility has increased since the 



^Dahl, DeLeire, and Schwabish| ( [2007 1 is a noteable exception. Dynan, Elmendorf, and Sichel 
(2007) provide an excellent survey of research on this subject in their Table 2, including Gottschalk 



and Moffitt|fll994P;|Moffitt and Gottschalk| ( |1995|); |Daly and Duncan| (|1997|); pynarski and Gruber 
rtl997t); ICameron and Tracyl (119981); IHaiderl (I2001I); IHyslopI (120011); IGottschalk and Moffittl (120021); 



Batchelder| (|2003| ) ; |Hacker| Q2006[); |Comin and Ra bin (200 6]); |Gottschalk and Moffitt| (|2006|; |Hertz 



(2006); Winship 



Schwabish (20071. See also Shin and Solon (2008) 



(2007); Bollinger and Ziliak (2007); Bania and Lcetc (2007); Dahl, DeLeire, and 
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1970s while the income volatility of others has remained roughly constant]^] This 
divergence of sample moments identifies our key result. 

Obviously, these findings could affect substantially the welfare and policy implica- 
tions of the rise in average volatility. The individuals whose volatility has increased 
- who we find are those with the most volatile incomes - may be those with the 
highest tolerance for risk or the best risk-sharing opportunities. Such risk tolerance 
is apparent not only from the willingness of these individuals to undertake volatile 
incomes or self-employment in the first place, but also from their answers to survey 
questions. 

While the basic results can be seen in summary statistics, providing a complete 
characterization of the dynamics of the volatility distribution is a methodological 
challenge. We use a standard model for income dynamics that allows income to 
change in response to permanent and transitory shocks. What is less standard is 
that we allow the variance of these shocks - our income volatility parameters - to be 
heterogeneous and time-varying. 

We estimate a discrete non-parametric model in which volatility parameters are 
assumed to take one of L unique values, where the number L and the values themselves 
are determined by the data. We add structure and get tractability with a variant 
on the Dirichlet process (DP) prior commonly used in Bayesian statistics. The 
Markovian hierarchical DP prior model we develop accounts for the grouped nature 
of the data (by individual) as well as the time-dependency of successive observations 
within individuals. Implicitly, we place a prior on the probability that an individual's 
parameter values will change from one year to the next, on the number of unique 



Our finding is consistent with Dynan, Elmendorf, and Sichel (20071 who find that increasing 
income volatility has been driven by the increasing magnitude of extreme income changes, by the 
increasingly fat tails of the unconditional distribution of income changes. The fat tails of the 
unconditional distribution of income changes has also been documented in Geweke and Keane ( 2000 1 . 
In its reduced form, our paper shows that these increasingly fat tails are borne largely by individuals 
who are ex-ante likely to have volatile incomes. The increasingly fat tails of the unconditional 
distribution are not attributable - or at least not solely attributable - to increasingly fat tails of the 
expected distribution for everyone. 
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parameter values an individual will hold over his lifetime, and on the number of 
unique parameter values found in the sample. 

In Section |2j we discuss our data and the summary statistics that drive our results. 
In Section [3j we present our statistical model including the income process (Section 
3.1] ), the structure we place on heterogeneity and dynamics in volatility parameters 



(Section 3.2), and our estimation strategy (Section 3.3). In Section [4] we show the 
results obtained by estimating our model on the data. Increases in the average 
volatility parameter are due to increases in volatility among those with the most 



volatile incomes (Section 4.2). We find that the increase in volatility has been 
greatest among the self-employed and those who self-identify as risk-tolerant (Section 



4.5), and that these groups are disproportionately likely to have the most volatile 



incomes (Section 4.4). Increases in risk are present throughout the age distribution, 



education distribution, and income distribution (Section 4.5). Section [5] concludes 
with a discussion of welfare implications. 



2 Data and summary statistics 
2.1 Data and variable construction 

Data are drawn from the core sample of the Panel Study of Income Dynamics (PSID). 
The PSID was designed as a nationally representative panel of U.S. households. It 
tracked families annually from 1968 to 1997 and in odd-numbered years thereafter; 
this paper uses data through 2005. The PSID includes data on education, income, 
hours worked, employment status, age, and population weights to capture differential 
fertility and attrition. In this paper, we limit the analysis to men age 22 to 60; 

we use annual labor income as the measure of income]^] Table [T] presents summary 

3 Labor income in 1968 is labeled v74 for husbands and has a constant definition through 1993. 
From 1994, we use the sum of labor income (HDEARN94 in 1994) and the labor part of business 
income (HDBUSY94), with a constant definition through 2005. Note that data is collected on 
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Table 1: Summary Statistics 
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This table summarizes data from 52,181 observations on 3,041 male household heads. 

statistics from these data. 

We want to ensure that changes in income are not driven by changes in the top- 
code (the maximum value for income entered that can be entered in the PSID). 
The lowest top code for income was $99,999 in 1982 ($202,281 in 2005 dollars), after 
which the top-code rises to $9,999,999. So that top-codes will be standardized in real 
terms, this minimum top-code is imposed on all years in real terms, so the top-code 



is $99,999 in 1982 and $202,281 in 2005. Since our income process in Section 3^ 
does not model unemployment explicitly, we need to ensure that results for the log of 
income are not dominated by small changes in the level of income near zero (which 
will imply huge or infinite changes in the log of income). To address this concern, 
we replace income values that are very small or zero with a non-trivial lower bound. 
We choose as this lower-bound the income that would be earned from a half-time job 
(1,000 hours per year) at the real equivalent of the 2005 federal minimum wage ($5.15 
per hour). This imposes a bottom-code of $5,150 in 2005 and $2,546 in 1982. Note 
that the difference in log income between the top- and bottom-code is constant over 



household "heads" and "wives" (where the husband is always the "head" in any couple). We use 
data for male heads so that men who are not household heads (as would be the case if they lived 
with their parents) are excluded. 
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Table 2: Distribution of Income, Excess Log Income, and Income Changes for Men 
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$62,012 


0.4601 
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Table [2] describes the distribution of labor income for men in the PSID over the period from 1968 to 
2005. See Section|2]for a detailed description of the income variable and the top- and bottom-coding 
procedure. Column 1 shows the distribution of real annual income for men (in 2005 dollars). The 
numbers in parentheses are the values with top- and bottom-coding restrictions. Column 2 shows 
the distribution of "excess" log income, the residual from the regression of log labor income (with 
top- and bottom-code adjustments) on the covariates enumerated in Section [2] Column 3 presents 
the distribution of one- year changes in excess log income. Column 4 repeats the results for column 
3, but presents five-year changes instead of one-year changes. 

time, so that differences over time in the prevalence of predictably extreme income 
changes cannot be driven by changes in the possible range of income changes. The 
vast majority of the values below this bound are exactly zero. This bound allows us 
to exploit transitions into and out of the labor force. At the same time, the bound 
prevents economically unimportant changes that are small in levels but large and 
negative in logs from dominating the results. Results are robust to other values for 
this lower bound, such as the income from full-time work (2,000 hours per year) at 
the 2005 minimum wage (in real terms) n 



4 The Winsorizing strategy employed here is obviously second-best to a strategy of modeling a 
zero income explicitly. Unfortunately, such a model is not feasible given the complexity added 
by evolving and heterogeneous volatility parameters. The other alternative would be simply to 
drop observations with low incomes, though we view this approach is much more problematic in our 
context; it would explicitly rule out the extreme income changes that are the subject of this paper. 
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In this paper, we model the evolution of "excess" log income. This is taken as 
the residual from a regression to predict the natural log of labor income (top- and 
bottom-coded as described). The regression is weighted by the PSID-provided sample 
weights, with the weights normalized so that the average weight in each year is the 
same. We use as regressors: a cubic in age for each level of educational attainment 
(none, elementary, junior high, some high school, high school, some college, college, 
graduate school); the presence and number of infants, young children, and older 
children in the household; the total number of family members in the household, and 
dummy variables for each calendar year. Including calendar year dummy variables 
eliminates the need to convert nominal income to real income explicitly. While this 
step is standard in the income process literature, it is not necessary to obtain our 
results. The results to follow are qualitatively the same and quantitatively similar 
when we use log income in lieu of excess log income. 

Table [2] presents data on the distribution of real annual income in column 1 (im- 
posing top- and bottom-code restrictions in parentheses). While the mean real income 
is nearly identical with and without top- and bottom-code restrictions ($50,553 versus 
$48,867), these restrictions on extreme values reduce the standard deviation of real 
income from $57,506 to $34,943. Column 2 shows the distribution of "excess" log 
income. Since excess log income is the residual from a regression, its mean is zero. 
The inter-quartile range of excess log income is —0.30 to 0.46. 

Column 3 presents the distribution of one-year changes in excess log income. 
Naturally, the mean of one-year changes is close to zero. The inter-quartile range 
of one-year changes is —0.11 to 0.14; excess income does not change more than 11 
to 14 percent from year to year for most individuals. However, there are extreme 
changes in income, so the standard deviation of changes to log income (0.49) is far 
great than the inter-quartile range. This implies either that changes to income have 
fat tails (so that everyone faces a small probability of an extreme income change), 
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or alternatively that there is heterogeneity in volatility (so that a few people face a 
non-trivial probability of an extreme income change). Unless a model is identified 
from parametric assumptions, these are observationally equivalent in a cross-section 
of income changes. However, heterogeneity and fat tails have different implications 
for the time-series of volatility, and we exploit these in the paper. 

Column 4 repeats the results from column 3, but presents five-year excess log 
income changes instead of one-year changes. These long-term changes have only 
slightly higher standard deviations than the one-year change, 0.69 vs. 0.49, suggesting 
some mean-reversion in income. Abowd and Card (1989) show that while one-year 



income changes are highly negatively correlated at one- year lags, there is no evidence 
of autocorrelated income changes at lags greater than two years. 

2.2 Volatility summary statistics 

Table [3] shows the evolution of volatility sample moments over time. The first three 
columns show the variance of permanent income changes]^] The final three columns 
present two-year squared changes in excess log income, a raw measure of income 
volatility]^] Note that while the mean size of an income change (columns 1 and 4, 
Table [3]) has increased over time, the median (columns 2 and 5) has not. This 
divergence can be explained by an increase in the magnitude of large unlikely income 
changes (columns 3 and 6). While not framed in this way, these features of the data 



have been identified in previous research, including Dynan, Elmendorf, and Sichel 



5 The variance of permanent income changes is the individual-specific product of two-year changes 
in excess log income (for example, between years t and t — 2) and the six-year changes that span 



them (for example, between years t + 2 and t — 4) . Meghir and Pistaferri ( 2004 ) show that this 



moment identifies the variance of permanent income changes (between years t-2 and t) under fairly 



general conditions, including the income process we use in Section 3.1 

6 A11 use weights from the PSID. The first row shows whole-sample results. The second row 
shows the percent change in the mean, median, or 95 th percentile over the sample. This is merely 
calculated as coefficient of a weighted OLS regression of the year-specific sample moment on a time 
trend, multiplied by the number of years (2005 — 1968) and divided by the whole-sample value in the 
previous row. The coefficient and t-statistic from this regression are shown just below. Year-by-year 
values arc then shown. 
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Table 3: Income Volatility Sample Moments 
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The year t permanent variance is the product of two-year changes in excess log income (from t — 2 
to t) and the six-year changes that span them (from t — 4 to t + 2). The year t squared change 
is from t — 2 to t. The hrst row shows full sample moments. The second row shows the percent 
change over the sample, calculated as the coefficient of a weighted OLS regression of year-specific 
sample moments on a time trend, multiplied by the number of years (2005-1968) and divided by the 
full sample moment. The coefficient and t-statistic are shown below. 
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Figure 1: Comparing Sample Variances for Those With and Without Large Past 
Income Changes 
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Following Meghir and Pistaferri (2004 1, the sample permanent variance is calculated as the product 
of two-year changes in excess log incomes (between years t and t-2) and the six-year changes that 
span them (between years t+2 and t-4). The sample transitory variance is calculated as the square 
of two-year changes in excess log income. Individuals arc defined as low past variances when their 
sample variance (permanent or transitory, respectively) four years ago is below median; individuals 
arc defined as high past variance when their sample variance four years ago is above the 95 th 
percentile. Weighted averages for these groups are presented in each year for which data is available 
for permanent variance (left panel) and transitory variance (right panel). 



( [2007fr . 

Table [4] and Figure [T] show the evolution of volatility sample moments separately 
for those who are ex-ante likely or unlikely to have volatile incomes. The left panel 
of Table [4] presents the sample mean of the permanent variance; the right panel 
presents the mean two-year squared excess log income change. For each year, the 
sample is split into two groups (below median or above 95 th percentile) based on the 
absolute magnitude of permanent (left panel) or squared (right panel) changes four 
years prior. Unsurprisingly, individuals with large past income changes tend to have 
larger subsequent income changes. The tendency to have large income changes is 
persistent, which indicates that some individuals have ex- ante more volatile incomes 
than others. 
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Table 4: Income Volatility Sample Moments by Past Volatility 
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The year t permanent variance is the product of two-year changes in excess log income (from t — 2 
to t) and the six-year changes that span them (from t — 4 to t + 2). The first and third columns 
show sample means for the cohort of individuals whose permanent variance and squared change, 
respectively, were below median in the year four years prior. The second and fourth columns show 
the same, but for the cohorts with past values above the 95 th percentile four years prior. The first 
row shows full sample moments. The third and fourth rows present the coefficient and t-statistic 
from a weighted OLS regression of year-specific sample means on a time trend. The difference in 
these two coefficients, divided by their average, is the % difference in the second row. Year-by-year 
means are shown below. 
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If (as we argue) volatility is increasing for high-volatility individuals but not for 
low- volatility individuals, then the gap in the sample variance between those with and 
without large past income changes should be increasing over time. This divergence 
over time in volatility between past low- and high-volatility cohorts is clear in both 
Table [4] and Figure [TJ The magnitude of income changes has been increasing more 
for those with large past income changes (who are more likely to be inherently high- 
volatility) than for those without such large past income changes (who are not). This 
is particularly apparent for the permanent variance; for the transitory variance, the 
finding is obscured slightly by the jump in volatility for everyone in the early- to 
mid-nineties (when the PSID changed to an automated data collection system which 
may have led to increased measurement error in income). This divergence illustrates 
the key stylized fact developed in this paper: the increase in income volatility can be 
attributed to an increase in volatility among those with the most volatile incomes, 
identified ex-ante by large past income changes. 



3 Statistical model 



3.1 Income process 



Here, we present a standard process for excess log income for individual % at time t 
(following |Carroll and Samwick , 1997 Meghir and Pistaferri 2004 and many others): 



Vi,t 
Pi,t 



p%,t + &,t + e i,t 

Pi,o + yj Wi,k + yj </>w,t-kWi t k- 



a; 



/c=i 



k=t— + 1 



i.t 



4>e,t-k£i,k 

k=t—q e +l 
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Excess log income {y^t) is the sum of permanent income (j>i,t), transitory income 
(£i,t), and measurement error (e^). The permanent shock, transitory shock, and 
measurement error are assumed to be normally distributed with mean zero as well 
as independent of one another, over time and across individuals. Permanent income 
is initial income (pi,o) plus the weighted sum of past permanent shocks (u^k, < 
k < t) with variance <J^ it = E [ujf t ~\ . Transitory income is the weighted sum of 
recent transitory shocks with variance o\ ii = E [e? f ] . We refer to a\ 



it 



{plit^tit) jointly as the volatility parameters. These will be allowed to differ 
between individuals to accommodate heterogeneity, and to evolve over time. This 
accommodates not just an evolving distribution of volatility parameters, but also 



systematic changes over the life-cycle in volatility paramters, as suggested by Shin 



and Solon (2008). Subcripts for % and t indicate that volatility parameters may differ 



across individuals and over time, as discussed in Section |3T2) "Noise variance" refers 
to the variance of measurement error, 7 2 = E [e? t ] . This measurement error could 
be subsumed into transitory income; it is kept separate only to accommodate our 
estimation strategy. 

Here, permanent shocks come into effect over periods, and transitory shocks 
fade completely after q e periods]^] As an example of our notation, <^ )2 denotes the 
weight placed on a permanent shock from two periods ago, in current excess 

log income; £) 2 denotes the weight placed on a transitory shock from two periods ago, 
Ei y t-2, in current excess log income. While we use the word "shock" for parsimony, 
these innovations to income may be predictable to the individual, even if they look 
like shocks in the data. Without loss of generality, we impose the constraint that 
the weights placed on transitory shocks sum to one {^2 k <fi £ ,k = !)• 



7 In 



Carroll and Samwick 



(19971, 4> w ^k — 4>ek = is assumed for k > 0, though the authors 



acknowledge that this assumption is unrealistic and design an estimation strategy that is robust to 



this restriction but do not estimate <pk- In Meghir and Pistaferri (2004) and Blundcll, Pistaferri, 



and Preston (forthcoming), <^ w fe = is assumed for k > but 4> e ± — is not. 



12 



3.2 Heterogeneity and dynamics 

We characterize the dynamics of volatility parameters, af t , using a discrete non- 
parametric approach. In a discrete non-parametric model, the variable of interest - 
here, the pair af t = {af { t , cr^ it ) - can take one of L possible values, {af}f =1 (where L 
and {af}f =1 for any given sample are determined by the data). The probability that 
af t takes a given value is a function of a) the distribution of values in the population, 
{lb}, where 11; is the proportion of the population whose parameter values are equal 
to of, b) the distribution of values for each individual i, {11^}, where IT^ is the 
proportion of individual Vs observations with parameter values are equal to of,, and 
c) the number of consecutive years Q^t with the most recent valuej^] In other words, 
of t has a given probability of changing from one year to the next; when it changes, 
it changes to a value drawn from the individual's distribution, {lb-}, which in turn 
consists of values drawn from the population distribution, {lb}. 

We add structure and get tractability by adding a prior commonly used in Bayesian 
analysis of such discrete non-parametric problems: the Dirichlet process (DP) prior. 
In a standard DP model, there is a "tuning parameter", B, which implicitly places 
a prior on the total number of unique parameter values in the sample, Lrj is 



defined more formally in Section 3.3 We set = 1, though our inference is not 



sensitive to this choice. In a hierarchical DP (HDP) model (recently developed by 



Teh, Jordan, Beal, and Blei 2007), the usual DP model is extended so by adding a 
second tuning parameter, 0j, which implicitly places a prior on the total number of 
unique parameter values for any given individual, Lf, we set 0j = 1. 

We extend this approach further to address panel data by including a Markovian 
structure on the hierarchical DP, giving us a Markovian hierarchical DP (MHDP) 
model. In our Markovian approach, the prior probability that the parameter is 



i.t- 



9 



Qi : t is the largest value satisfying af t _ 1 — of t _ q for all < qu < Q 

In large samples the expected number of unique values is of the order 61og((7V + 6)/6) where 



N is the number of observations. (Liu 19961 
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unchanged from the previous period depends on the number of consecutive years 
with that value, Q^t- We add a third tuning parameter, 9, to place a prior on the 
probability of changing the parameter value, p (of f = of t-1 \i, t) = Qi,t/{9 + Qi,t)\ we 
set 9 = 1. In the MHDP model, our prior parameters can then be characterized with 
the triple 6 = {6, Q it 9} = {1,1, 1}. 

Given our research question, a key advantage of this set-up is that it does not 
restrict the shape (or the evolution of the shape) of the cross-sectional volatility 
distribution. We view our discrete non-parametric model and the structure placed 
on it by our MHDP prior as providing a sensible middle ground between tractability 
and flexibility. 



3.3 Estimation 



We estimate the income process from Section 3A on annual data from the PSID 
(detailed in Section [2]) for excess log income. When data are missing, mostly be- 
cause no data was collected by the PSID in even-numbered years following 1997, we 



impute bootstrapped guesses of income, These bootstrapped values add no ad- 



ditional information; they merely accommodate our estimation strategy in a setting 
with missing data in a way that is intended to minimize the possible impact on our 



results. Here, we outline an approach for combining the prior from Section 3.2 with 



data on excess log income, y, to form a posterior on the distribution of volatility 



10 Wc examine the two-year change in excess log income that spans any single- year of missing data. 
We identify the set of two-year excess log income changes with a similar magnitude elsewhere in the 
data and select one at random. This bootstrapped draw has an intermediate value which is used to 
fill in the missing data. For example, consider an individual with excess log income of 0.1 in 1999, 
0.5 in 2001 and (since the PSID did not gather data in the intervening year) missing in 2000. From 
the set of all sample observations with two-year excess log income changes in the neighborhood of 
0.4, we select one at random. In general, this observation will be drawn from a different individual 
than the one with the missing data. Imagine that the individual-years drawn at random have excess 
log incomes of 0.6, 0.7, and 1.0 in 1972, 1973, and 1974, respectively. We then fill in the original 
individual's missing data in 2000 with 0.2 (0.1+0.7-0.6). We drop individuals with longer spans of 
missing data. 
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parameters, 0" 2 {^j] Further details and an algorithm for implementation are provided 
in the appendix. 

Consider the problem of estimating af t , the volatility parameters for person % in 
year t, if all other parameters &-(it) (and (j>) were known. The decision tree for 
estimation is shown in Figure [2] and described here, both with references to relevant 
equations in the appendix. 



Level 1 <j\ t can remain unchanged from last year (af t = cr? t _ l5 eq: [7j) or can change 
+ °lt-\y ec i : |sb- If a tt changes; 



Level 2 of t can change to a value from the set of other values for that individual (of t G 
<rf_ t and af t ^ o~\±-\, eq: p| or can take on a value new to the individual 



af t of_ t , eq: 10). If af t takes on a value new to the individual; 



Level 3 af t can be a value held by other individuals (of t G 0-(it) an d o~f t £ of _ ( - eq: 
1 1 ) or can be a new value not shared with other individuals (of t ^ cr 2 ^ t j , eq: 



12). 



The probability that o~f t takes a given value is a function of a) the likelihood of 
generating estimated shocks (u ijt , e^t) given af t and b) the prior probability of af t . 

The prior probability that the parameter remains unchanged in Level 1 (of 4 = 
ofi_i) is proportional to Qj )t ; the prior probability that the parameter changes is 
proportional to 8. If the parameter changes in Level 1 (of t ^ 

o~it-i)-> the prior 

probability that of f changes to a value held by that individual in another year in 
Level 2 is proportional to the number of times that value occurs in other years for 
that individual; the prior probability that a\ t changes to a new value not seen for 
that individual in another year is proportional to Oj. If the parameter changes to a 
new value not seen for that individual in another year in Level 2, the prior probability 



u y is the ragged N by T+l matrix, with yn in the i-th row of the i+l-th column, a 2 = {c^ "?} 
is the pair of ragged N by T matrices, with of, i t and a\ i t in the i-th row of the £-th column of a 2 



and , respectively. 
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Figure 2: Model Hierarchy 
initial value 




no change (j7l) 



reverts to a 
previous value Q 



others share 



new value 



o individual) (110 



new value 



this value (11) 



(to sample) (12) 



Diagram describes evolution of volatility parameters. The numbers 1, 2, and 
3 in circles at each decision node correspond to the levels of the hierarchy 
described on page 15 The numbers ^ through (12) identify the equation 



number giving the probability of reaching that branch. 



that af t changes to one of the other population values in Level 3 is proportional to 
the number of times that value occurs within the population; the prior probability 
that af t changes to a new value not seen elsewhere in the population is proportional 
to 0. 

A detailed outline of this estimation algorithm is given in the appendix. The 
appendix shows this compound prior algebraically, and also shows how it is combined 
with the data to produce a posterior for af t . We proceed iteratively through all 
t within an individual and all i across individuals. This entire scheme for choos- 



ing volatility values o 2 is nested within a larger Gibbs sampling algorithm (Geman 



and Geman, 1984). This Markov Chain Monte Carlo (MCMC) approach simultane- 



ously estimates the other parameters of our model, namely shocks (oj,e) and income 
coefficients ((f), r y 2 ). 
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Table 5: Basic Model Results 



1 llffvi r~\l 1 T 

Ul&ll lDUX 


ion of Variance Parameters 




Permanent 


Transitory 




Variance 


Variance 


\\/\ onn 


0.0713 


0.2771 


OL. JJcv. 


0.4685 


1.0471 


N 

IN 


67,725 


67,725 


1 st % 
1 /o 


0.0200 


0.0499 


5 th % 


0.0250 


0.0506 


10 th % 


0.0301 


0.0510 


25 th % 


0.0313 


0.0518 


50 th % 


0.0321 


0.0530 


75 th % 


0.0331 


0.0572 


90 th % 


0.0356 


0.2452 


95 th % 


0.0498 


1.2187 


99 th % 


0.8909 


5.5030 



Shocks' Rate of Entry/Exit 



Distribution of posterior means of a 



lag 


• 


4>uj,k 


<Pe,k 


k = 





0.381 


0.784 






(0.088) 


(0.029) 


k = 


1 


0.865 


0.180 






(0.072) 


(0.025) 


k = 


2 


0.951 


0.037 






(0.064) 


(0.017) 



from k periods ago 
4> E ,k- impact of transitory shock 

from k periods ago 
Standard errors in parentheses. 



The left panel presents the posterior mean estimates of the volatility parameters, a 2 . The distri- 
butions presented here consider all years and all individuals together. The right panel of this table 
present the mapping of shocks to income changes. 



4 Results 



Here, we present the model parameters estimated using the methods from Section 
3.3| The chief object of interest is the evolution of the cross-sectional distribution 

We begin 



of volatility parameters, 07, over time. These are shown in Section 



4.2 



with more basic results. In subsection |4.1[ we present estimates of the homogeneous 
parameters (j> that map shocks to income changes and the unconditional distribution 



of volatility parameters, a . In Section 4.3, we rule out alternative explanations. 



In Sections AA and 4J3, we map these volatility parameter estimates to individuals' 
demographic or risk attributes. 
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Figure 3: Distribution of Permanent and Transitory Variance 
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This figure presents the distribution of erf and cr^ . These are the distribution of posterior means 
estimated from the data, as presented numerically in Table [5] These posteriors of the permanent 
variance and transitory variance are calculated for each individual in each year, as described in 
Section |3.3| The distributions presented here show all years and individuals together. Values 
are truncated at the 95 th percentile for the permanent variance and at the 90th percentile for the 
transitory variance. Mean and median of the truncated part of each distribution is given. 



4.1 Basic results 

Table [5] presents the basic parameter estimates obtained from fitting our model to the 
PS1D income data described in Section pO The left panel shows the distribution of 



risk in the population, <r| and <xf,. Formally, we present the distribution of posterior 
means of permanent and transitory variance parameters. The right panel show the 
mapping from shocks to income changes, <j), which we constrained to be constant over 
time and across individuals. 

Note the extreme skew and fat tails (kurtosis) in the distribution of volatility 
parameters, a 2 , shown in the left panel of Table [5J. While medians are modest, means 
far exceed medians. At the median, transitory shocks have a standard deviation of 
approximately 23% annually; permanent shocks have a standard deviation of just 
under 18% annually. However, the highest volatility observations imply shocks with 
standard deviations well above 100% annually. Figure [3] plots these skewed and 



Figure 4: Impulse Response Function 



Permanent Shock 




o -I 

12 3 

years since shock 



This figure presents an estimated 
a permanent (left panel) and 



Permanent and Transitory Shocks 



Transitory Shock 




o 1 2 3 

years since shock 



impulse response function for 
transitory (right panel) shock. 



fat-tailed distributions by truncating the right tail. 

As shown in the right panel of Table [5j permanent shocks enter in quickly (<j> u> k 
are close to one) while transitory shocks damp out quickly (<p £ ,k fall to zero). The 
impact of a shock on the evolution of income is presented in Figure |4j These present 
impulse response functions for a permanent (left panel) and transitory (right panel) 
shock. Shocks were calibrated as a one standard-deviation shock for an individual 
with volatility parameters at the estimated means (pulled from Table [5]). 

4.2 Evolution of the volatility distribution 

Here, we show how the distribution of posterior means of variance parameters has 
evolved over time. This evolution is shown in Tables [6] and also in Figure [5} Table 
[6] shows the year-by-year distribution of volatility parameters (of) posterior means. 
This table mirrors Table [3j with volatility parameter (of t ) posterior means replacing 
reduced form moments. The first three columns show results for the permanent vari- 
ance parameter, crj; the final three columns show results for the transitory variance 
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Table 6: Year-by-Year Income Volatility Parameters 



Permanent Variance, ctJ Transitory Variance, o'^ 





Mean 


Median 


95 th % 


Mean 


Median 


95 th % 


Average 


0.0713 


0.0321 


0.0498 


0.2771 


0.0530 


1.2186 


% Change 


73% 


0% 


71% 


99% 


1% 


154% 


Slope 


0.0014 


0.0000 


0.0010 


0.0074 


0.0000 


0.0508 


(t-stat) 


(6.84) 


(3.78) 


(6.31) 


(7.02) 


(9.37) 


(6.25) 


1970 


0.0573 


0.0321 


0.0424 


0.1568 


0.0526 


0.4498 


1971 


0.0502 


0.0321 


0.0406 


0.1901 


0.0526 


0.6419 


1972 


0.0411 


0.0320 


0.0374 


0.1909 


0.0527 


0.7775 


1973 


0.0550 


0.0321 


0.0389 


0.2027 


0.0528 


0.7997 


1974 


0.0481 


0.0322 


0.0437 


0.1848 


0.0528 


0.5520 


1975 


0.0547 


0.0321 


0.0397 


0.1923 


0.0530 


0.7597 


1976 


0.0663 


0.0321 


0.0464 


0.2746 


0.0529 


1.3527 


1977 


0.0540 


0.0321 


0.0409 


0.2424 


0.0529 


1.1020 


1978 


0.0557 


0.0321 


0.0411 


0.1865 


0.0529 


0.6785 


1979 


0.0738 


0.0321 


0.0432 


0.2226 


0.0528 


1.0134 


1980 


0.0748 


0.0321 


0.0452 


0.2012 


0.0529 


0.7139 


1981 


0.0651 


0.0321 


0.0504 


0.1986 


0.0529 


0.7762 


1982 


0.0594 


0.0321 


0.0502 


0.2055 


0.0529 


0.8885 


1983 


0.0744 


0.0321 


0.0457 


0.2550 


0.0531 


1.2691 


1984 


0.0660 


0.0321 


0.0503 


0.2307 


0.0531 


0.9686 


1985 


0.0593 


0.0321 


0.0477 


0.2260 


0.0530 


1.0063 


1986 


0.0672 


0.0321 


0.0441 


0.2557 


0.0529 


1.1042 


1987 


0.0679 


0.0321 


0.0477 


0.2448 


0.0530 


1.1468 


1988 


0.0714 


0.0321 


0.0467 


0.2286 


0.0531 


0.9494 


1989 


0.0629 


0.0321 


0.0490 


0.2462 


0.0529 


1.3182 


1990 


0.0801 


0.0321 


0.0607 


0.2387 


0.0530 


0.9812 


1991 


0.0726 


0.0321 


0.0600 


0.2708 


0.0530 


1.2466 


1992 


0.0633 


0.0321 


0.0539 


0.2431 


0.0531 


1.0536 


1993 


0.0887 


0.0321 


0.0701 


0.4290 


0.0532 


2.6502 


1994 


0.0916 


0.0321 


0.0628 


0.4229 


0.0532 


2.3884 


1995 


0.0764 


0.0321 


0.0583 


0.4080 


0.0532 


2.2152 


1996 


0.0609 


0.0321 


0.0541 


0.4167 


0.0531 


2.4093 


1997 


0.0721 


0.0321 


0.0499 


0.3916 


0.0531 


2.3408 


1999 


0.0769 


0.0321 


0.0519 


0.3059 


0.0532 


1.5679 


2001 


0.0975 


0.0322 


0.0719 


0.2616 


0.0531 


1.0974 


2003 


0.1026 


0.0322 


0.0967 


0.4771 


0.0534 


2.4896 


2005 


0.1294 


0.0324 


0.0592 


0.4379 


0.0538 


2.2246 



The construction of posterior means for of, and erf for each individual in each year is detailed in the 
text. The first row shows the full sample distribution, so that the second column shows the median 
value of the posterior mean of cr^ over a h individual- years. The second row shows the percent 
change over the sample, calculated as the coefficient of a weighted OLS regression of year-specific 
sample moments on a time trend, multiplied by the number of years (2005-1968) and divided by the 
full sample value. The coefficient and t-statistic are shown below. 
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parameter, a e . The first and fourth columns present means of the permanent and 
transitory variance parameter posterior means, the second and fifth columns present 
medians of parameter posterior means, and the third and sixth columns present 95 th 
percentiles. All use weights from the PSID. The first row shows whole-sample 
results. The second row shows the percent change in the mean, median, or 95 th 



percentile over the sample. 12 The coefficient and t-statistic from this regression are 
shown just below. Year-by-year values are then shown. 

Table [6] shows that the mean of permanent and transitory parameters have in- 
creased substantially over the sample (by 73 and 99 percent, respectively) while the 
medians have not (0 and 1 percent increases, respectively). This divergence can 
be explained by an increase in the magnitude of permanent and transitory variance 
parameters at the right tail, among individuals with the highest parameters (the 95 th 
percentile values increasing 71 percent and 154 percent, respectively). Colloquially, 
the kind of people whose incomes had always moved around a lot are moving around 
even more than they used to; the median person's income does not move more than 
it used to. 

This pattern can be seen graphically in Figure [5| which shows the year-by-year 
evolution of many quantiles of the distribution of permanent and transitory variance 
posterior means. In the bottom panels of Figure |5j we plot the 1st, 5 th , 10th, 25 th , 
50th, and 75 th percentile values of the posterior mean of the permanent (a 2 , left) and 
transitory (er 2 , right) variance parameters by year. These are very stable and show 
no clear upward trend. The size of this increase is extremely small economically. 
Looking at all but the "risky" tail of the distributions, the distributions look very 
stable. 

In the middle and upper panels of Figure [5j we show the evolution of the "risky" 

12 This is calculated as coefficient of a weighted OLS regression of the year-specific moments from 
below on a time trend, multiplied by the number of years (2005-1968) and divided by the whole- 
sample value in the previous row. 
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tail of the distribution of posterior means. In this case, variance parameters increase 
strongly and significantly. This increase in the right tail of the distribution explains 
the increase in the mean completely. 

4.3 Heterogeneity or fat tails? 

So far, we have shown that the increases in income volatility can be attributed solely 
to increases in the right tail of the volatility distribution. To obtain this result, our 
model assumes that the distribution of shocks is normal conditional on the volatility 
parameters. When the unconditional distribution of shocks is fat-tailed (has high 
kurtosis), this is automatically attributed to heterogeneity in volatility parameters. 
An alternative hypothesis is that there is little or no heterogeneity in volatility pa- 
rameters, but that shocks are conditionally fat-tailed. 

When looking at the cross-section of income changes, heterogeneity in volatility 
parameters (with conditionally normal shocks) and conditionally fat-tailed shocks 
(without no heterogeneity in volatility parameters) are observationally equivalent; 
they both imply a fat-tailed unconditional distribution of income changes. By ex- 
amining serial dependence, it is possible to reject the hypothesis that everyone has 
the same volatility parameter. If shocks are conditionally fat-tailed but everyone 
has the same volatility parameters, then those with large past income changes should 
be no more likely than others to experience large subsequent income changes. If 
individuals differ in their volatility parameters and those volatilities are persistent, 
then individuals with large past income changes will be more likely than others to 
have large subsequent income changes. 

This possibility is investigated in Table [4] and shown graphically in Figure [T] 
These compare the sample variance of income changes for individuals with and with- 
out large past income changes. In each year, a cohort without large income changes 
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Figure 5: Evolution of Percentiles of Volatility Distribution 
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These figures show the evolution of various percentiles of the posterior mean of the permanent (left) 
and transitory (right) variance for various percentiles of the distribution of variance parameters. 
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is formed as the set of individuals whose measure of variance, either permanent vari- 
ance or squared income change, was below median four years ago; a cohort with large 
income changes is formed as the set of individuals whose measure of variance was 
above the 95 th percentile four years ago. This four-year period is chosen so that 



income shocks are far enough apart to be uncorrelated. (Abowd and Card, 1989) 

Note that individuals with large past income changes tend to have larger sub- 
sequent income changes. The tendency to have large income changes is persistent, 
which indicates that some individuals have ex- ante more volatile incomes than others. 

The divergence over time in volatility between past low- and high- volatility cohorts 
is clear in both Figure [T] and Table |4j The magnitude of income changes has been 
increasing more for those with large past income changes (who are more likely to be 
inherently high- volatility) than for those without such large past income changes (who 
are not). This increase in volatility falls primarily on those who could be expected to 
have volatile incomes to begin with. This shows that the increase in volatility among 
the volatile we find in the model cannot be attributed to increasingly fat-tailed shocks 
for everyone. 



4.4 Whose incomes are volatile? 

In this paper, we have identified increasing volatility for men in the U.S. since 1968 
as being driven solely by the right (volatile) tail of the volatility distribution. Here, 
we examine the attributes of men with highly volatile incomes. 

Table [7] presents the results from a probit regression to predict whether a person- 
year estimate of the (posterior mean) volatility parameter is above the 90 th percentile 
for that year. Note from the first row that self-employed individuals are much more 
likely to have highly volatile incomes. The second row shows that "risk tolerant" 
individuals are also much more likely to have highly volatile incomes. Risk tolerance is 
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Table 7: Determinants of High Income Volatility (Probit) 



Dependent 


Permanent 


Transitory 


Variable 


Variance 


Variance 


self-employed? 1 or 


0.6001 


0.7794 




(24.07)*** 


(32.22)*** 




[0.1085] 


[0.1533] 


risk-tolerant? 1 or 


0.1303 


0.0950 




(5.91)*** 


(4.31)*** 




[0.0180] 


[0.0131] 


age 


0.0104 


0.0082 




(7.82)*** 


(6.20)*** 




[0.0014] 


[0.0011] 


years of education 


-0.0041 


-0.0123 




(-0.89) 


(-2.67)*** 




[-0.0006] 


[-0.0017] 


income>median? 1 or 


-0.2277 


-0.2922 




(-9.84)*** 


(-12.65)*** 




[-0.0308] 


[-0.0398] 


have children? 1 or 


-0.0498 


-0.0686 




(-1.48) 


(-2.04)** 




[-0.0068] 


[-0.0094] 


number of children 


0.0120 


0.0068 




(0.90) 


(0.51) 




[0.0016] 


[0.0009] 


married? 1 or 


-0.1009 


-0.1815 




(-3.00)*** 


(-5.56)*** 




[-0.0143] 


[-0.0270] 


R 2 


0.0469 


0.0751 


observations 


31,898 


31,898 



Results from a probit regression to predict an indicator variable for whether posterior mean variance 
(permanent or transitory volatility) estimate is is above the 90 th percentile for that year. "Risk 
tolerant" is set to 1 if the PSID risk tolerance variable exceeds 0.3. Above-median income indicates 
that four-year lagged income is above-median for that (lagged) year. *, **, and *** indicate 
significance at the 10%, 5%, and 1% levels, respectively, z-statistics are in parentheses. Marginal 
effects are in square brackets. 
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identified from answers to hypothetical questions about lotteries, designed to elicit the 
individual's coefficient of relative risk- aversion; risk-tolerant individuals are defined 
as those with an estimated coefficient of relative risk-aversion below 1/0.3. 

High income individuals (those with incomes above median four years before the 
observation in question) are less likely to have volatile incomes. Individuals with more 
years of education are also less likely to have volatile incomes. Older individuals are 
more likely to have volatile incomes, a result driven by the large number of high- 
volatility individuals between ages 50 and 60. Unsurprisingly, men who are married 
and/or who have children are less likely to have volatile incomes. 

4.5 Whose incomes are increasingly volatile? 



Section |4.4 identified attributes of individuals with volatile incomes. In particular, 



the self-employed and those whose answers to survey questions suggest they are risk- 
tolerant are more likely to have volatile incomes. Here, we examine the increase in 
volatility over time among these groups. 



Table [8] predicts the posterior mean variance (volatility) estimates described ear- 
lier with a linear time trend. The "change" row shows the coefficient on calendar 
time; the "percent change" row shows the expected percent change over the sample 
implied by this coefficient. The top panel presents results for the permanent variance; 
the bottom panel presents results for the transitory variance. Each column presents 
results for a different sub-sample. By comparing the first two columns, note that that 
volatility has increased dramatically more for self-employed people than for others. 
These individuals have much higher average levels of volatility, but their percentage 
change in volatility is still higher than for other individuals. Self-employed individ- 
uals account for a substantial proportion of the overall increase in income volatility. 
Similarly, the increase in permanent volatility (the variance of permanent shocks) 
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Table 8: Volatility Trends by Self-Employment, Income, and Risk Tolerance 



Permanent Variance 





Self- Employment 


Income 


Risk Tolerance 




self- 


not self- 


> med. 


< med. 


risk 


not risk 


sample 


employed 


employed 


income 


income 


tolerant 


tolerant 


change per year 


0.0048 


0.0011 


0.0018 


0.0009 


0.0035 


0.0012 


% change '68-'05 


194% 


58% 


135% 


36% 


172% 


76% 




(6.17)*** 


(4.58)*** 


(5.99)*** 


(2.75)*** 


(4.61)*** 


(4.50)*** 


N 


6,068 


41,766 


10,336 


23,876 


23,958 


18,029 


Transitory Variance 




Self-Employment 


Income 


Risk Tolerance 




self- 


not self- 


> med. 


< med. 


risk 


not risk 


sample 


employed 


employed 


income 


income 


tolerant 


tolerant 


change per year 


0.0262 


0.0061 


0.0040 


0.0116 


0.0100 


0.0076 


% change '68- '05 


176% 


101% 


125% 


101% 


117% 


114% 




(11.27)*** 


(13.80)*** 


(10.84)*** 


(13.22)*** 


(7.81)*** 


(9.45)*** 


N 


6,068 


41,766 


23,876 


23,958 


10,336 


18,029 



Results from a weighted OLS regression to predict the posterior mean variance (volatility) estimate 
with a linear time trend. The "change" row shows the coefficient on calendar time; the "percent 
change" row shows the expected percent change over the sample implied by this coefficient. This 
is (100 percent) times (2005 minus 1968) times (the coefficient on calendar time) divided by (the 
average posterior mean in the sample). The top panel presents results for the permanent variance; 
the bottom panel presents results for the transitory variance. Each column presents results for 
a different sub-sample. "Risk tolerant" means that the PSID risk tolerance variable exceeds 0.3. 
Above-median income indicates that four-year lagged income is above-median for that (lagged) year, 
t-statistics are in parentheses. 



is much greater for those who self-identify as risk tolerant (those whose estimated 
coefficient of relative risk aversion less than 1/0.3) than those who do not. Transi- 
tory volatility does not show major differences in trend for risk tolerant and not risk 
tolerant individuals. 

Table [8] shows that the increase in volatility is apparent throughout the income 
distribution. While increases in the average variance of transitory shocks are sim- 
ilar (in proportional terms) for those with above- and below-median income, the 
variance of permanent shocks has increased more for those with above-median in- 
come than for those with below-median income. While below-median individuals 



are over-represented among those with the highest volatilities (Section 4.5), low in- 
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Table 9: Volatility Trends by Age and Education 



Permanent Variance 





at 






education 






less than 


at least 


more than 


high 


less than 


sample 


40 yrs old 


40 yrs old 


high school 


school 


high school 


mean change/year 


0.0006 


0.0018 


0.0024 


0.0005 


0.0004 


% change '68-'05 


44% 


76% 


120% 


28% 


22% 




(3.66)*** 


(4.55)*** 


(6.08)*** 


(1.71)* 


(1.17) 


median change/year 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


% change '68-'05 


0% 


0% 


1% 


0% 


0% 




(0.79) 


(4.29)*** 


(5.20)*** 


(-1.36) 


(0.20) 


95 th %tile chnge/year 


0.0007 


0.0010 


0.0008 


0.0007 


0.0012 


% change '68-'05 


53% 


67% 


63% 


55% 


72% 




(8.35)*** 


(6.47)*** 


(10.32)*** 


(6.50)*** 


(2.31)** 


N 


23,928 


23,906 


23,455 


15,516 


8,863 


Transitory Variance 






;e 




education 






less than 


at least 


more than 


high 


less than 


sample 


40 yrs old 


40 yrs old 


high school 


school 


high school 


mean change/year 


0.0057 


0.0096 


0.0093 


0.0065 


0.0066 


% change '68- '05 


86% 


123% 


120% 


102% 


95% 




(9.36)*** 


(13.27)*** 


(12.14)*** 


(8.76)*** 


(6.91)*** 


median change/year 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


% change '68- '05 


1% 


2% 


2% 


2% 


3% 




(6.87)*** 


(18.73)*** 


(11.18)*** 


(13.69)*** 


(7.60)*** 


95 th %tile chnge/year 


0.0378 


0.0649 


0.0598 


0.0483 


0.0467 


% change '68- '05 


124% 


211% 


183% 


188% 


135% 




(7.87)*** 


(17.10)*** 


(12.15)*** 


(11.04)*** 


(5.78)*** 


N 


23,928 


23,906 


23,455 


15,516 


8,863 



Results from a weighted OLS regression to predict the posterior mean variance (volatility) estimate 
with a linear time trend. The "change" row shows the coefficient on calendar time; the "percent 
change" row shows the expected percent change over the sample implied by this coefficient. This 
is (100 percent) times (2005 minus 1968) times (the coefficient on calendar time) divided by (the 
average posterior mean in the sample). The top panel presents results for the permanent variance; 
the bottom panel presents results for the transitory variance. Each column presents results for a 
different sub-sample, t-statistics are in parentheses. 
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come individuals are not driving the increase in volatility among those with the most 
volatile incomes. 

Table [9] presents results by age and educational attainment. Note that while 
magnitudes vary, the increase in volatility at the right tail is present for those below 
and above 40, and across the education distribution. 



5 Conclusion 

Increases in the size of income changes in the PSID can be attributed almost entirely 
to the "right tail" of the volatility distribution. Taking volatility as a proxy for 
risk, those who would have had risky incomes in the past now face even more risk. 
Everyone else has had no substantial change. 

Without knowing more, the welfare implications of this finding are unclear. De- 
pending on what kind of people have volatile incomes, an increase in volatility at 
the volatile end of the distribution could be more or less bad than an increase in 



volatility for everyone. Consider the possibility (which we refute in Section 4.4) that 
risk tolerance is independent of income volatility or expected income. In this case, 
increasing volatility at the volatile end of the distribution decreases welfare more 
than increasing risk throughout the distribution. When individuals have decreasing 
absolute risk aversion, high levels of income risk (proxied here by volatility) make 



people more vulnerable to additional risk. (Gollier, 2001) If there is a compensating 
differential for risk so that volatile incomes are also higher on average, then this effect 
will be mitigated or reversed. 

This paper shows that those with the most volatile incomes are also the most 
risk-tolerant. In this case, the increase in risk has hit those best able to handle 
it. To the degree that income volatility is chosen (e.g., by choosing an occupation), 
we would expect those with the highest tolerance for risk or the best risk-sharing 
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opportunities to take on the most volatile incomes. If it is these individuals whose 
volatility has increased, it could blunt substantially any welfare costs associated with 
increased income volatility. Since the increase in volatile has fallen disproportion- 
ately on the self-employed, it could also reflect an increase in profitable (but volatile) 
business opportunities. In this case, there could even be welfare gains associated 
with increased income volatility. 
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A Appendix A: Estimation 



We estimate the joint posterior distribution of all unknown parameters conditional 
on our observed data as: 

p(0,7 2 ,w,£,<r 2 |y) ocp(?/|0,7 2 ,a;,e) -p(u},e\a 2 ) ■ p(a 2 ) (2) 

Following Bayes rule, the distribution of parameters given the data 7 2 ,w,e, a 2 \y)\ 

- is proportional to the product of the distribution of the data given those parameters 

- p(y\<f>, 7 2 ,a;,e) - and the probability of those parameters - p(u>,e\a 2 ) ■ pip 2 ). We 
will estimate the posterior distribution of our unknown parameters by Markov Chain 



Monte Carlo (MCMC) simulation, specifically the Gibbs sampler. (Geman and Ge- 



man 



1984 ) The Gibbs sampler estimates the full posterior distribution in equation 
(|2j) by iteratively sampling a value for each unknown parameter conditional on the 
current values of the other unknown parameters. In other words, we iterate over the 
following steps. 

Step 1: Sample new values of (0, 7 2 ) from 7 2 |?/,a;,e,(T 2 ) 

Step 2: Sample new values of (u>,e), the shock parameters for each person and year, 
from p(uj : e\y, 0, 7 2 ,<r 2 ) 

Step 3: Sample new values of (cr 2 ), the volatility parameters for each person and year, 
from p(cr 2 |t/,<^,7 2 ,a;,£). 

These sampling steps form a Markov chain that is iterated until the set of all 
parameters has converged to their joint posterior distribution. This algorithm is pro- 
grammed in Python and run on a grid cluster of computers. One run of this model 
(with 10,000 iterations) takes several weeks, though multiple runs can be done si- 
multaneously. Each of the runs was started from a randomly sampled set of initial 
parameter values. These multiple runs were used to evaluate convergence of the algo- 
rithm to a reasonable set of samples from the posterior distribution of all parameters. 
The first 5000 iterations of each chain was discarded as the pre-convergence burn-in 
period, and our inference was based upon the remaining sampled values. 

A.l Step 1: Sampling income process parameters (<j), 7 2 ) 

In this step, we take realized shocks (u>,e) as well as excess log income data (y) as 
given, to estimate the rate at which shocks pass through to income (<f>). Reorganizing 
equation ([T|) and setting limits of q w = q £ = 3 (a conservative choice according to 



Abowd and Card, 1989), we get the following dynamic linear model, 

t-3 t t 

k=0 k=t-2 k=t-2 

For each individual i, the dynamic linear model for their excess log income (y») is a 
combination of the homogeneous parameters (<f>) and realized shocks (u}.£). In our 
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Gibbs sampling model implementation, we take advantage of the fact that sampling 
new values of the homogeneous parameters conditional on fixed values of the realized 
shocks is relatively simple, and vice versa. 

If we are given values of the realized shocks (u>i,£i), we can calculate the scalar 
y* t and the 1x6 (since + q e =G) vector X ijt , 

t-3 

Vi,t = Vi,t ~ ^J^i.ifc Xi,t = {^i,t-2-, ^i,t-l, ^i,ti e i,t-2i £ i,t-l> e i,t) 

k=0 

Let y* be the N(T — 3) x 1 vector of all y* t across individuals i and time t, and let 
X be the N(T — 3) x 6 matrix whose rows are all X i t across individuals % and time 
t. We can then write equation (]3| as a simple linear regression model, 

y* = X ■ I e where e ~ Normal(0, 7 2 • I) 

where = (^ w ,2> <j>u,\t 4>u,o, <f>e,2, 4>e,i, 4>e,o) are the homogeneous parameters of interest. 
Note that this is the stage at which we use measurement error (e) as distinct from 
transitory shocks. 

We use non-informative prior distributions for both j 2 and /?, which leads to the 
following posterior distributions (the Bayesian analog of a least-squares estimate): 



(tn (jf-xpytf-xp) 



7 ~ Inv — Gamma . 

V 2 

~ Normal ^, 7 2 ■ (X'X)- 1 ) (4) 

where P = (X'X) l X'y* as in a least-squares regression. We sample new values 
of 7 2 and <f> from the distributions in Q, but with the additional constraint that 

Efc^ = L 

A. 2 Step 2: Sampling realized shocks (o;,e) 

In this step, we take excess log income data (y), the homogeneous parameters (</)), 
and the volatility parameters (a 2 ) as given. We use these to sample realized shocks 

If we are now given values of the homogeneous parameters ((f)), then the only 
unmeasured variables in our dynamic linear model ^ are the realized shocks (Ui,£i). 



We use maximum likelihood estimates from a Kalman filter (Kalman 1960) to sample 
new values of the realized shocks (ojj,£i), as outlined in Carter and Kohn (1994). 
Given the homogeneous parameters (<^, 7 2 ) and the collection of volatility parameters 
(cr 2 ), each individual's income process is independent, so run the Kalman filter and 
sampling procedure for the realized shocks (u)i,£i) for each individual i separately. 
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A. 3 Step 3: Sampling volatility parameters (cr 2 ) 



In this step, we take sampled realized shocks (u,e) as given and use these to sample 
estimates of volatility parameters (o -2 ). In order to sample a full set of volatil- 
ity parameters <x 2 from the distribution p(cr 2 \y,^), r y 2 ,u,e), it is easiest to proceed 
sequentially by sampling (one-by-one), the volatility parameters of t for individual 
% and year t from the distribution ]>(cr 2 \y. 0. 2 . u;. £. c 2 .^) ) • Note that under this 
scheme, information about (of J comes from our sampled permanent and transitory 
shocks (ui^e^t) as well as our current estimates of the volatility parameters, ff -a t t) > 
from other years within the individual as well as other individuals. We link these 
other volatility values cr 2 ^ ^ to our shock parameters (u; ijt , e^) through the posterior 
distribution, 

K^K*> £ M>°^-(M)) <xp(ui,t,£i,tWi,t) ■ P( a lt\°^(i,t)) ( 5 ) 

The first term of equation ^ comes from the likelihood of our realized shocks 
(uJi ; t, £i,t) from our dynamic linear model, 

|2\ II 2 \— ~ f ^- ^it 1 ^it \ 

pK*,e i)t |<T ijt ) oc [a^ i)t a eM ) 2 exp ~ (6) 

V A<J uj,i,t Z<J e,i,t/ 

The second term of equation ^ is our Markovian hierarchical Dirichlet process 



(MHDP) prior, p(of Jcr 2 ^), described in Sections 3.2 and 3.3 Sampling new values 
af t from the posterior distribution (J5| is a multi-step process that acknowledges the 
structure of our population. First, we sample a volatility parameter proposal value 
(cr 2 = {Vu*i a £-k}) fr° m a continuous distribution /(•). For our implementation, we 
used an inverse-Gamma distribution, which is commonly used for variance parame- 
ters. We will set af t = cr 2 only if we cannot find a suitable af t G &-u t t) i- e - among 
our currently existing values in the population. 



A. 3.1 Level 1: Is volatility unchanged from last year? 

We first consider the posterior probability that af t = uf t _i, 

P(oi,t = a i,t-i) « Qi,fP(ui,t,£i,t\(rlt-i) ( 7 ) 
K°f,t + a lt-x) oc 9 -p{uji^e i)t \al) (8) 

Recall that Qij is the number of consecutive years with parameter values cr 2 i _ 1 ; 9 is 
the prior tuning parameter for Level 1. We compare the posterior probability that 
volatility values are unchanged from last year {<J 2 t _i in equation fl7|) to the posterior 
probability that volatility values are equal to the proposal value (o~ in equation rt8j). 
We sample a possible value for af t from this posterior distribution, either cr 2 < _ 1 or cr 2 , 
where choice is made stochastically by flipping a weighted coin with weights equal 
to the probabilities in equations (17]) and (|8j). If this weighted coin flip selects cr? t _ 1: 
then we set af t = cr 2 t _ v If the coin flip selects cr 2 , we do not set af t = crf^ and 
instead proceed to Level 2 to find af t . 
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A. 3. 2 Level 2: Is volatility the same as in another year? 

Given that we did not choose to set af t ^ crft-i, we consider the posterior probability 
that af t G of_ t . If there are Li unique values of G of _ t , the posterior probability 
that af t = of. is, 

P(°i,t = °i) « ^ • P(^i,t> £i,t\°i) l = !,■■■, h (9) 
P«^<-t) oc Qi-p(uJi^ei tt \al) (10) 

n« is the number of occurrences of value of. within the set of possible values o^/ it \; Gj 
is the prior tuning parameter for Level 2. We sample one of these Lj + 1 choices by 
flipping a weighted coin with weights proportional to the probabilities above. If this 
weighted coin flip selects of G of t , then we set of t = of. If the coin flip selects of, 
we do not set af t = of. for any of t ^ of t but instead proceed to Level 3 to find af t . 

A. 3. 3 Level 3: Is volatility the same as another person's? 

Given that af t of t , we consider the posterior probability that af t G at^ where oi i 
are the volatility values that currently exist in the population outside of individual %. 
If there are L unique values of G o^, the posterior probability that of t = of is, 



P(°f,t = °f) oc nj •p(w M ,e i>t |o-, 2 ) l = l,...,L (11) 
p{crlti °-(i,t)) oc G-p(u; M ,£ M |of) (12) 

n; is the number of occurrences of of within the set of current volatility values over 
all people other than person i\ G is the prior tuning parameter for Level 3. We sample 
one of these L + 1 values by flipping a weighted coin with weights proportional to 
the probabilities above. If this weighted coin flip selects of G of^ t y then we set 
erf t = erf. If the coin flip selects of, we set crf t = of. of represent new volatility 
values that have not yet been seen in the population. 

The three steps outlined above result in a sampled volatility value af t for person 
i and year t, conditional on the other volatility values (r\ it y We can repeat this 
procedure for all other years and individuals to update our full set of volatility values 
a 2 . 
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